Parsing and Productivity1
نویسندگان
چکیده
It has often been argued that the (type or token) frequency of an affix in the lexicon cannot be used to predict the degree to which that affix is productive. Affix type frequency refers to the number of different words which contain an affix, token frequency refers to the summed lexical frequency of those words. The observation that neither of these counts relates straightforwardly to productivity, raises difficult questions about the source of different degrees of productivity, making the nature of morphological productivity one of the “central mysteries of word-formation” (Aronoff 1976:35). If productivity does not arise as a function of frequency, then where does it come from? This paper argues that frequency and productivity are, in fact, intimately linked. Type and token frequency in the lexicon are not good predictors of productivity. But frequency counts of decomposed forms in the lexicon can predict the degree to which an affix is likely to be productive. The problem with doing a straightforward frequency count of forms containing an affix, is that not all affixed forms contain the affix to the same degree. Some affixed words are highly affixed, and are highly decomposable (e.g. tasteless). Other affixed words appear more opaque, and tend to be characterised by whole word access, rather than parsing (e.g. listless). We argue that the former set facilitate productivity much more strongly than the latter set. Decomposed forms in the lexicon arise from parsing in perception. By coming to a clear understanding of the types of factors which tend to lead to parsing in perception, then, we can predict the degree to which an affix is represented by decomposed forms in the lexicon, and so (we argue), the degree to which it is likely to exhibit productivity. Thus, we argue that there is a strong relationship between parsing in perception, 1We are indebted to Andrew Carstairs-McCarthy, Wolfgang Dressler, Anke Luedeling, Janet Pierrehumbert, Ingo Plag and Robin Schafer, whose comments have greatly improved the quality and coherence of this paper. Remaining errors or incoherencies are the sole responsibilities of the authors.
منابع مشابه
An improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملبررسی مقایسهای تأثیر برچسبزنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی
In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...
متن کاملتأثیر ساختواژهها در تجزیه وابستگی زبان فارسی
Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...
متن کاملبرچسبزنی خودکار نقشهای معنایی در جملات فارسی به کمک درختهای وابستگی
Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...
متن کاملRelative Clause Ambiguity Resolution in L1 and L2: Are Processing Strategies Transferred?
This study aims at investigating whether Persian native speakers highly advanced in English as a second language (L2ers) can switch to optimal processing strategies in the languages they know and whether working memory capacity (WMC) plays a role in this respect. To this end, using a self-paced reading task, we examined the processing strategies 62 Persian speaking proficient L2ers used to read...
متن کامل